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In the circuit model, quantum computers rely on the availability of a universal quantum gate set. 
A particularly intriguing example of such a set is the "matchgates" along with SWAP, the simple 
exchange of two qubits. In this paper, we show a simple decomposition of arbitrary matchgates 
into better known elementary gates, and implement one matchgate in a linear optics experiment 
using single photons. We characterize the gate performance via quantum process tomography and 
represent the resulting quantum process in a novel way, as a fidelity map in the space of all possible 
nonlocal two-qubit unitaries. In addition, we propose a new non-local, diagnostic process measure. 



In quantum computation, an essential requirement of 
the circuit model is a universal gate set, which enables 
the approximation of any given unitary process to arbi- 
trary precision [H . The advantages and disadvantages of 
various gate sets are still being actively explored; for ex- 
ample, one set may be more natural than others for inter- 
preting a certain problem, or, in a given physical architec- 
ture one set may require far less resources than another. 
The best known gate-set class is that of any entangling 
2-qubit gate in combination with arbitrary single-qubit 
unitaries 0: most famously the 2-qubit CNOT gate in 
conjunction with the 1-qubit Hadamard, H, and phase, 
T, gates. Circuits constructed of solely the CNOT and 
H gates can be simulated efficiently with a classical com- 
puter However, the addition of the T gate — itself also 
efficiently simulatable — enables universal quantum com- 
puting (which of course is generally believed not to be ef- 
ficiently simulatable). Another important gate-set class 
uses 3-qubit entanglers, such as the Toffoli gate — which 
has only recently been demonstrated in linear optics Q 
and ion traps Q — along with H. 

In this paper we demonstrate a new gate-set class 
based only on 2-qubit gates, specifically the matchgate 
0, which can be entangling, and the SWAP gate, which 
is strictly non-entangling [6|j7|- Matchgates — originally 
introduced in graph theory [5| — are 2-qubit unitaries. 
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where A and B, the 1-qubit unitaries formed by 
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are members of U{2) with det{A)—det{B) , and act on the 
even and the odd 2-qubit parity subspaces respectively 
[2^ . Matchgates include a rich number of entangling 
gates — including maximal entangling gates — as well as 



many classes of local gates. If matchgates act only be- 
tween nearest-neighbour qubits, then the resulting cir- 
cuit can be efhciently simulated classically In this 
context matchgates relate to systems of non-interacting 
fermions [6l|. Moreover, they are connected to ID quan- 
tum Ising models: explicit matchgate circuits for simula- 
tions of such strongly-correlated quantum systems have 
been constructed in [8|. However, if matchgates are also 
allowed to operate between nea;t-nearest neighbours — a 
seemingly trivial change achieved via SWAP gates — then 
the circuit can perform universal quantum computation 
[jI. Clearly, the resulting universal gate set is entirely 
different from the two mentioned above. 

Here we show how to realise arbitrary matchgates us- 
ing circuit elements already demonstrated in a range of 
physical architectures; we go on to demonstrate and mea- 
sure matchgate operation in a linear optics photonic sys- 
tem, quantifying their performance with a new method 
for experimental analysis of two-qubit gates. 

Fig.[T^) shows the dccompostion of an arbitrary match- 
gate, Gab, into CNOT, controlled- unitary and single-qubit 
gates. Recall that, depending on the parity of the in- 
put state into the matchgate, either A or B acts on the 
two qubits. The first step of our general decomposition 
is to encode the parity of the two-qubit state onto one 
of the qubits. The first CNOT gate turns the bottom 
qubit into state |0) for even-parity inputs (|00)-^|00) & 
|11)^|10)) and into |1) for odd-parity inputs (|01)^|01) 
& |10)— >|11)). This qubit then acts as the control for 
the controlled unitary, CU [i^, where u=BA~-'^. If the 
bottom qubit is in state |0) (parity=0), A will act on the 
top qubit; if it is in state |1) (parity=l), CU will undo 
A before it performs B. The final CNOT gate returns the 
qubits from the parity encoding to the original basis. 

Note that, alternatively, any 2-qubit operation could 
be implemented_with three CNOT gates and 8 single- 
qubit unitaries or two -gates (yet to be experi- 
mentally demonstrated) and 6 single-qubit unitaries ■ 
Our decomposition, which offers a starting point for fur- 
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FIG. 1: a) Decomposition of general matchgates Gab- b) 
Simplified decomposition of symmetric matchgates Gaa as de- 
scribed in the text, c) The same gate after flipping the CNOTs 
with Hs and decomposing HAH into x- and z-rotations. d) 
Further simplification shows that Gaa, up to a global phase, 
can be implemented using a single CZg and local unitaries 
Vi=HXa and V2=z_fl/2X/3H where a, /3, and 9 are related to 
A via HAH=XcZ_£)/2X^. For Ghh, 6=-k, and CZe CZ. 



thcr simplifications, in contrast requires fewer gates and 
allows one to recast matchgate circuits (e.g. those in [§|) 
into circuits consisting of more familiar quantum gates 
— the CNOT nl, , the more general CU Q i ^ind sin- 
gle qubit rotations, all of which have been individually 
implemented in various architectures. 

We now show how to build the matchgates required 
for a universal gate set. The universality proof in 
relies on showing how matchgates and swap can 
be applied to implement another universal set — H, T 
and CZ. As outlined in the additional online material 
[2^ . each logical qubit has to be encoded in two (or 
four) physical qubits. The required matchgate set is 
then Gxx, Gtt and Ghh- These gates are all symmetric 
i.e. A=B, and the circuit of Fig. [T^) is greatly sim- 
plified, because A~^A=i: the controlled unitary turns 
into a "controlled identity". The resulting circuit dia- 
gram, shown in Fig.[T]D), still requires two 2-qubit gates. 
It does however resemble the general construction of a 
CU [13] gate, i.e. it can be replaced by a single CU 
and 4 single-qubit unitaries in the following way: We 
flip the CNOTS upside down by adding 4 Hadamards — 
CNOT=(h(Kih)xcnotx (H(g)H) — and rewrite the resulting 
central unitary HAH, up to a global phase, as y^a.'L-e ji^fi 



[H, Fig. [It). The rotations Xq and commute with 
their respective CNOTs, which allows us to express the 
two-qubit part of the operation as a single controlled CZg, 
Fig. [TJi). This simplified circuit for Gaa can now be di- 
rectly implemented in bosonic systems using the tech- 
nique of shortcuts through higher-dimensional Hilbert 
spaces [3)]. 

Turning back to the universal matchgate set, we find 
that Gxx=X(8'X and Gtt=T(8'I — both operations are lo- 
cal and can be done trivially in a photonic architecture 
with waveplates or interferometers. Similarly, swap is a 
straightforward procedure in optics, either in free-space 
or integrated circuits. The one non-trivial gate to be 
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which is a nonlocal, maximally entangling gate. For this 
gate the decomposition yields d~-K and therefore CZg in 
Fig- [TJi) is the well known CZ gate- Experimentally, we 
can therefore implement Ghh using polarization-encoded 
single photons, partially polarizing beam splitters and co- 
incidence detection 14, 15, 16]- The experimental setup 
is explained in Fig- [2 
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FIG- 2: Experimental scheme- Orthogonally polarized pho- 
ton pairs are created in a nonlinear ppKTP crystal which is 
pumped by a 410 nm laser diode, using focussing parameters 
from [13]- The photons are split at a polarizing beamsplit- 
ter (PBS), collected, and guided to the circuit with single- 
mode optical fibers. Their input polarizations are set with 
a PBS, one quarter- and one half- wave plate (QWP, HWP) 
(Prep.). They are then superposed on a partially polariz- 
ing beamsplitter (PPBS) which transmits 2/3 of horizontal, 
l-ff) and perfectly reflects vertical, \y) light. Loss elements 
(PPBS') correct the respective amplitudes and the unknown 
phase shift at the central PPBS is compensated by a combi- 
nation of a QWP, a HWP and another QWP in one output 
port of the PPBS. If compensated correctly, only the input 
term \HH) picks up a phase shift of tt due to non-classical 
interference, which realizes a bit-flipped CZ when measured 
in coincidence. The photons are then jointly analyzed with a 
QWP, a HWP and a PBS (Tomo.) before they are detected 
by two single-photon detectors. The rotations for the 4 single- 
qubit unitaries required for Ghh in Fig. [TJi) are incorporated 
into the preparation and measurement waveplate settings. 



We characterized our gate using full quantum process 
tomography [l8| . preparing 16 combinations of the states 
m, \V), \D)=l/V2{\H) + \V)), \R)=l/V2{\H)+^\V))} 
at each input and projecting into an overcomplete set of 
36 measurements at the outputs. The photons were then 
detected in coincidence. The resulting process matrix 
Xexp^ reconstructed via maximum- likelihood estimation, 
is shown in Fig. [3] The process fidelity with Xideai is 



92.3 ± 0.2%, where the error was calculated assuming 
Poissonian count statistics. We attribute the remaining 
errors to non-ideal waveplates, imperfections in the 
spatial and temporal mode-overlap, as well as in the 
splitting ratios of the PPBS's. 
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FIG. 3: a) Real and b) imaginary parts of the Ghh process 
matrix x^^p reconstructed from quantum process tomography 
measurements. The wireframes represent the ideal process 
Xideai- The proccss fidelity of Xexp with Xtdeai was 92.3±0.2%. 
We did not apply any numeric local rotations to optimize this 
result in contrast to, e.g. 0]. 



The question remains: how well we have actually im- 
plemented a matchgate? A single process fidelity, such as 
calculated above, reveals the overlap between the exper- 
imental process and the corresponding, ideal target pro- 
cess. However, it does not yield any information about 
the way in which the process is not ideal — is it just mixed 
due to random noise or are we in fact implementing a dif- 
ferent unitary process than we actually thought? In par- 
ticular, we are interested in the nonlocal properties of the 
quantum process, as they define its entangling power and 
errors in them cannot be corrected with local operations. 

Interestingly, as shown in [l^ . out of the 15 real param- 
eters which define a unitary 2-qubit operator U€SU{A), 
only three actually describe the nonlocal part of the uni- 
tary, the remaining 12 relating to local transformations: 

[/=(ui®wi)e^(=i'"-®'"-+'=^'"«®'"«+=3'"-®'"-)(u2®t;2). (4) 

This decomposition allows a very intuitive geometri- 
cal representation of local equivalence classes. The three 
nonlocal parameters ci, C2 and C3 can be used to con- 
struct a 3-dimensional space of nonlocal gates. Sym- 
metries reduce this space to a tetrahedron, shown in 



Fig. |4] (a) , the so-called Weyl-chamber [2^ . Each point in 
this chamber represents all locally-equivalent gates with 
unique nonlocal properties [i^l; the CNOT gate, CZ, and 
therefore also Ghh gate, are all locally equivalent and lo- 
cated at [7r/2,0, 0]. Using a method from [2^, one can 
directly check whether two gates are locally equivalent, 
which confirms the results obtained from our matchgate 
decomposition in Fig. [1] In order to illustrate our ex- 
perimental process in the Weyl chamber and to find the 
nonlocal unitary gate closest to it, we numerically trans- 
lated 6201 evenly-spaced ideal 2-qubit operators defined 
by (H]) into their process representation and then calcu- 
lated their maximal nonlocal process fidelity, 

•^n;(ci,C2,C3) = max J^(Xexp,X(ci...C3,Ui...t)2)). (5) 

Ui . ..V2 

to Xexp by numerical optimization over the local trans- 
formations and U2,V2- The result is a three- 
dimensional process fidelity map, shown in Fig. 
and c) for the ideal and experimental processes respec- 
tively. After optimization over the local unitaries, we 
find a maximum process fidelity for our experimental 
gate of 94.7±0.3% (increased from 92.3%, Fig. ^ at 
[ci, C2, C3]~[7r/2, 0, 0]. At first it may seem that the Tni 
([5|) is not a very sharp measure but in fact the volume of 
nonlocal gates with >90% fidelity to Xideai is just 11.6% 
[2^ . For Xexp, this volume shrinks to 4.85%, due to de- 
coherence. 

We can now define a new process distance measure — 
the nonlocal distance, A„;=-^ Ac\ + Ac\ + Ac2, where 
A„z£[0, tt] and Aci is the difference of the coordinates of 
the target unitary gate and the coordinates for the max- 
imum overall fidelity Tni ( ci , C2, C3)niax obtained from the 



optimization, Eq. ([5]) [2g|. According to [2l|, A„; meets 
all distance measure criteria for pure processes and can 
be seen as a diagnostic measure. Our experimental gate 
is located at a distance A„;~0 from the ideal CZ. The 
uncertainty in this value is dominated by the numeri- 
cal optimization. Upon closer inspection, this does not 
come at a surprise; any uncorrelated noise process such as 
depolarization or dephasing would show up as mixture, 
i.e. an uniform decrease of Tni over the whole non-local 
space. Imperfections in optical components such as the 
PPBS central to our gate similarly lead to mixing because 
the underlying 2-qubit operations for reflectivity values 
around r?idea/=l/3 are non- unitary [2^. A similar argu- 
ment holds for other imperfections, such as temporal or 
spatial mode mismatch [30| . In other words, in contrast 
to the CU gates demonstrated in 0], our optical setup 
does not have any non-local unitary degrees of freedom. 

We conclude that the nonlocal properties of our gate 
could not be improved by any unitary corrections. We 
therefore attribute the remaining reduction in fidelity to 
mixing, which is supported by the measured, non-ideal 
process purity of 89.8 ± 0.4%. 

This method, the projection of a quantum process into 
the nonlocal, unitary 2-qubit operator sub-space is likely 
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FIG. 4: a) Tetrahedron depicting the space of all nonlocal two-qubit unitary gates. All local gates u®v, like Gxx and Gtt, 
are represented by [0,0,0]; SWAP is located at [0, 7r/2, 7r/2]. The shaded volume represents the space of all perfect entanglers 
[20l |. i.e. gates that can turn separable states into Bell states. The (thick blue) line [7,0,0] contains all Gaa gates, which are 
equivalent to CU [3]. Ghh is located at its midpoint [k/2, 0, 0], and is therefore the only symmetric matchgate which is maximally 
entangling, b-c) Point-by-point locally optimized process fidelity map, Eq.O of all ideal nonlocal gates with b) the ideal target 
gate and c) the experimentally reconstructed Ghh process matrix. The maximum fidelity is 94.7 ± 0.3% at ~[7r/2, 0, 0]. 



to have a number of interesting applications in quantum 
information processing, e.g. process tomography, a topic 
which is rapidly increasing in importance as quantum 
computer architectures evolve. Suggested Hnes for fur- 
ther research are, for example, how different noise pro- 
cesses influence nonlocal properties of a quantum gate 
and nonlocal process discrimination [2^. 

In conclusion, we have implemented a matchgate cir- 
cuit which allows universal computation when combined 
with the simple two-qubit SWAP. We characterized our 
gate using quantum process tomography and illustrated 
the fidelity overlap of the experimental process with all 
possible nonlocal gates in the Weyl chamber. 

We wish to thank B. P. Lanyon and N. K. Langford for 
valuable input. We acknowledge financial support from 
the Australian Research Council Discovery and Federa- 
tion Fellow programs and an lARPA-funded U.S. Army 
Research Office contract. SR acknowledges financial sup- 
port by the FWF project CoQuS No. W1210-N16. AMS 
acknowledges support by the Natural Sciences and En- 
gineering Research Council of Canada, Quantum Works 
and the Canadian Institute for Advanced Research. 



* These authors contributed equally to this work. 
[1] M. Nielsen and I. Chuang, Quantum computation 
and quantum information (Cambridge University Press, 
2000). 

[2] J.L. Dodd et al, Phys. Rev. A 65, 040301(R) (2002). 
[3] B. Lanyon et al, Nature Physics 5, 134 (2008). 
[4] T. Monz et al, Phys. Rev. Lett. 102, 040501 (2009). 
[5] L. Vahant, SIAM Journal on Computing 31, 1229 (2002). 
[6] B.M. Terhal and D.P. DiVincenzo, Phys. Rev. A 65, 
032325 (2002). 



[7] R. Jozsa and A. Miyake, Proceedings of the Royal Society 

A 464, 3089 (2008). 
[8] F. Verstraete, J.I. Cirac, and J.I. Latorre, Phys. Rev. A 

79, 032316 (2009). 
[9] G. Vidal and CM. Dawson, Phys. Rev. A 69, 010301(R) 

(2004). 

[10] J. Zhang et al, Phys. Rev. Lett. 93, 020502 (2004). 

[11] J.L. O'Brien and G. Pryde et al, Nature 426, 264 (2003). 

[12] S. Gasparoni et al, Phys. Rev. Lett. 93, 020504 (2004). 

[13] A. Barenco et al, Phys. Rev. A 52, 3457 (1995). 

[14] N.K. Langford et al, Phys. Rev. Lett. 95, 210504 (2005). 

[15] N. Kiesel et al, Phys. Rev. Lett. 95, 210505 (2005). 

[16] R. Okamoto et al, Phys. Rev. Lett. 95, 210506 (2005). 

[17] A. Fedrizzi et al. Opt. Express 15, 15377 (2007). 

[18] J.L. O'Brien et al, Phys. Rev. Lett. 93, 080502 (2004). 

[19] B. Kraus and J.I. Cirac, Phys. Rev. A 63, 062309 (2001). 

[20] J. Zhang et al, Phys. Rev. A 67, 042313 (2003). 

[21] A. Gilchrist, N.K. Langford, and M. A. Nielsen, Phys. 
Rev. A 71, 062310 (2005). 

[22] T.C. Ralph et al, Phys. Rev. A 65, 062324 (2002). 

[23] A. Laing, T. Rudolph, and J.L. O'Brien, Phys. Rev. Lett. 
102, 160502 (2009). 

[24] Relaxing the condition det{A)=det{B) would allow 
Gix=SWAP, where I is the 1-qubit identity. 

[25] CNOT = |0>(0|®I+|1>(1|®X, CU = |0)(0|®I + |l)(l|(g)U 

[26] See EPAPS supplementary material at [URL will be in- 
serted by AIP] for details 

[27] With the exception of the C3=0 plane, which is symmetric 
around the line [n/2, 0, 0] [tv/2, 7r/2, 0]. 

[28] The process fidelity has a range of £ [0.25, 1]. 

[29] This distance, can of course be calculated between any 
two points in the chamber. 

[30] In fact, most imperfections can be modeled as incoherent 
sums of different unitary processes, which would lead to 
several local fidelity maxima in the Weyl chamber. 



ADDITIONAL ONLINE MATERIAL 

We briefly review the universality proof given by R. 
Jozsa and A. Miyake in 7], which involves showing how 
one can construct an already known universal set from 
Gab +S WAP. Because Gab is intrinsically a two-qubit gate 
and does not allow a change of parity of the input state, it 
is not straightforward to construct arbitrary single-qubit 
unitaries. This is solved in Q by encoding the logical 
qubits |0)^, into two physical qubits which have the 
same parity |0)^=|00), |1)^ = |11). A symmetric Gaa act- 
ing on the two physical qubits then performs the single- 
qubit unitary A on the logical qubit, Fig. [5^). This 2- 
qubit encoding is universal with next-next-nearest neigh- 
bor interactions, whereas a more complicated 4-qubit en- 
coding, which requires the non-trivial Gzx, allows univer- 
sal quantum computing with only next-nearest neigbour 
interactions 01 ■ 

Next, we need an entangling gate which can act be- 
tween two encoded logical qubits. This can be achieved 
by adding SWAP, because, as illustrated in Fig. EJj), 



CZ=GhhXSWAPxGxxXGhh, where X denotes the ax Pauh 
operator. In summary, the minimal set of matchgates 
needed for universal quantum computing is Ghh and Gtt, 
which implement the single-qubit gates H and T on the 
encoded dual-qubit space, and Gxx, Ghh and swap, which 
together form a CZ. 
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FIG. 5: Matchgate universality proof 7]. a) Symmetric 
matchgates Gaa act on two physical qubits which represent 
one logical qubit. b) CZ can be constructed using symmet- 
ric matchgates and SWAP. Dotted ellipses represent logical 
qubits. 



